Corpus: vie_news_2022

Other corpora

3.12.9 Problems with sentence segmentation - Words ending in a stopword

Most frequent words ending in a stopword. They usually contain uppercase letters as result form missing blanks.

Stopword Concatenated word Frequency of stopword Frequency of concatenated word
Anh ‘Anh 28644 8
Anh dienAnh 28644 7
Trong ‘Trong 41937 5
Theo ‏Theo 44869 4
Trong ‏Trong 41937 4
Nga ‘Nga 31087 3
166 msec needed at 2023-03-04 03:02